A Semantics-based English-Bengali EBMT System for translating News Headlines

نویسندگان

  • Diganta Saha
  • Sivaji Bandyopadhyay
چکیده

The paper reports an Example based Machine Translation System for translating News Headlines from English to Bengali. The input headline is initially searched in the Direct Example Base. If it cannot be found, the input headline is tagged and the tagged headline is searched in the Generalized Tagged Example Base. If a match is obtained, the tagged headline in Bengali is retrieved from the example base, the output Bengali headline is generated after retrieving the Bengali equivalents of the English words from appropriate dictionaries and then applying relevant synthesis rules for generating the Bengali surface level words. If some named entities and acronyms are not present in the dictionary, transliteration scheme is applied for obtaining the Bengali equivalent. If a match is not found, the tagged input headline is analysed to identify the constituent phrase(s). The target translation is generated using English-Bengali phrasal example base, appropriate dictionaries and a set of heuristics for Bengali phrase reordering. If the headline still cannot be translated using example base strategy, a heuristic translation strategy will be applied. Any new input tagged headline along with its translation by the user will be inserted in the tagged Example base after generalization.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Phrasal EBMT System for Translating English to Bengali

The present work describes a Phrasal Example Based Machine Translation system from English to Bengali that identifies the phrases in the input through a shallow analysis, retrieves the target phrases using a Phrasal Example base and finally combines the target language phrases employing some heuristics based on the phrase ordering rules for Bengali. The paper focuses on the structure of the nou...

متن کامل

Example-Based Machine Translation for Low-Resource Language Using Chunk-String Templates

Example-Based Machine Translation (EBMT) for low resource language, like Bengali, has low-coverage issues, due to the lack of parallel corpus. In this paper, we propose an EBMT for low resource language, using chunk-string templates (CSTs) and translating unknown words. CSTs consist of a chunk in source-language, a string in target-language, and word alignment information. CSTs are prepared aut...

متن کامل

Contrastive Analysis of Political News Headlines Translation According to Berman’s Deformative Forces

The present research aimed at investigating the deformation of political news headlines translation between English and Persian News Agencies based on Berman`s deformative system. For this purpose, 100 news headlines in English were selected from BBC, Reuters, Associated Press, France, France 24, Financial Times, Business Times, New York Times, Politico, Guardian, CNN, Bloomberg, Middle East Ey...

متن کامل

Example Based English-Bengali Machine Translation Using WordNet

In this paper we propose an architecture of EnglishBengali Example Based Machine Translation (EBMT) using WordNet. The proposed EBMT system has five steps: 1) Tagging 2) Parsing 3) Prepare the chunks of the sentence using sub-sentential EBMT 4) Using an efficient adapting scheme, match the sentence rule 5) Translate from Source Language (English) to Target Language (Bengali) in the chunk and ge...

متن کامل

Analyzing Emotion in Blog and News at Word and Sentence Level

Emotion is crucial to identify as it is not open to any objective observation or verification. In this paper, emotion analysis on blog texts has been carried out for a less privileged language, Bengali and the same system has been applied on the English SemEval 2007 affect sensing corpus containing only news headlines. A set of six emotion tags, namely, happy, sad, anger, fear, disgust and surp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005